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MODIFIED BACILLUS THURINGIENSIS 
INSECTICIDAL-CRYSTAL PROTEIN GENES AND THEIR 
EXPRESSION IN PLANT CELLS 

This invention provides a modified Bacillus 
thuringiensis ("Bt") gene (the "modified BtlCP gene") 
encoding all or an insecticidally-ef fective portion of 
a Bt insecticidal crystal protein ("ICP") . a plant, 
transformed with the modified Bt ICP gene can show 
higher expression levels of the encoded ICP and 
improved insect-resistance. 

Background of the Invention 

Plant genetic engineering technology has made 
significant progress during the last 10 years. It has 
become possible to introduce stably foreign genes into 
plants . This has provided exciting opportunities for 
modern agriculture. Derivatives of the Ti-plasmid of 
the plant pathogen , Aqrobacterium tune faci ens , have 
proven to be efficient and highly versatile vehicles 
for the introduction of foreign genes into plants and 
plant cells . In addition, a variety of free DNA 
delivery methods , such as electroporation, 
microinjection, pollen-mediated gene transfer and 
particle gun technology , have been developed for the 
same purpose . 

The major aim of plant transformations by genetic 
engineering has been crop improvement . In an initial 
phase, research has been focused on the engineering 
into plants of useful traits such as insect-resistance. 
In this respect , progress in engineering insect 
resistance in transgenic plants has been obtained 
through the use of genes , encoding ICPs, from Bt 
strains (Vaeck et al. , 1987) . A Bt strain is a spore 
forming gram-positive bacterium that produces a 
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parasporal crystal which is composed of crystal 
proteins which are specifically toxic against insect 
larvae. Bt ICPs possess a specific insect icidal 
spectrum and display no toxicity towards other animals 
and humans (Gasser and Fraley, 1989) . Therefore, the Bt 
ICP genes are highly suited for plant engineering 
purposes . 

For more than 20 years, Bt crystal spore 
preparations have been used as biological insecticides. 
The commercial use of Bt sprays has however been 
limited by high production costs and the instability of 
crystal proteins when exposed in the field (Vaeck et 
al. , 1987) . The heterogeneity of Bt strains has been 
well documented . Strains active against Lepidoptera 
(Dulmage et al. , 1981) , Diptera (Goldberg and Margalit, 
1977) and Coleoptera (Krieg et al. , 1983) have been 
described. 

Bt strains produce endogenous crystals upon 
sporulation. Upon ingestion by insect larvae, the 
crystals are solubilized in the alkaline environment of 
the insect midgut giving rise to a protoxin which is 
subsequently proteolytically converted into a toxic 
core fragment or toxin of 60-70 kDa. The toxin causes 
cytolysis of the epithelial midgut cells. The 
specificity of Bt ICPs can be determined by their 
interaction with high-affinity binding sites present on 
insects 1 midgut epithelia. 

The identification of Bt ICPs and the cloning and 
sequencing of Bt ICP genes has been reviewed by Hofte 
and Whiteley (1989) . The Bt ICP genes share a number of 
common properties . They generally encode insecticidal 
proteins of 13 0 kDa to 14 0 kDa or of about 70 kDa, 
which contain toxic fragments of 60 ± 10 kDa (Hofte and 
Whiteley, 1989) . The Bt ICP genes have been classified 
into four major groups according to both their 



WO 91/16432 



PCT/EP9 1/00733 



structural similarities and insecticidal spectra (Hofte 
and Whiteley, 1989} : Lepidoptera-specif ic (Cryl) , 
Lepidoptera- and Diptera-specif ic (Cryll) , Coleoptera- 
specific (Crylll) and Diptera-specif ic (Cry IV) genes. 
5 The Lepidoptera-specific genes (Cryl) all encode 
130-140 kDa proteins . These proteins are generally 
synthesized as protoxins. The toxic domain is localized 
in the N-terminal half of the protoxin. Deletion 
analysis of several Cryl genes confirm that 3 » portions 

10 of the protoxins are not absolutely required for toxic 
activity (Schnepf et al . , 1989). Cry II genes encode 65 
kDa proteins (Widner and Whiteley, 1985) . The Cry II A 
proteins are toxic against both Lepidoptera and Diptera 
while the Cry II B proteins are toxic only to 

15 Lepidopteran insects . The Coleoptera-specif ic genes 
(Cry III) generally encode proteins with a molecular 
weight of about 70 kDa. (Whiteley and Hofte, 1989) . The 
corresponding gene (cry III A) expressed in coli 
directs the synthesis of a 72 kDa protein which is 

20 toxic for the Colorado potato beetle. This 72 kDa 
protein is processed to a 66 kDa protein by spore- 
associated bacterial proteases which remove the first 
57 N-terminal amino acids (Mc Pherson et al. , 1988) . 
Deletion analysis demonstrated that this type of gene 

25 cannot be truncated at its 3 « -end without the loss of 
toxic activity (Hofte and Whiteley, 1989) . Recently, an 
anti-coleopteran strain, which produces a 130 kDa, 
protein has also been described (European patent 
application ("EPA") 89400428.2) . The cry IV class of 

30 crystal protein genes is composed of a heterogenous 
group of Diptera-specif ic crystal protein genes (Hofte 
and Whiteley, 1989) . 

The feasibility of generating insect-resistant 
transgenic crops by using Bt I CPs has been 
demonstrated. (Vaeck et al. , 1987 ; Fischoff et al., 
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1987 and Barton et al. , 1987). Transgenic plants offer 
an attractive alternative and provide an entirely new 
approach to insect control in agriculture which is at 
the same time safe, environmentally attractive and 
cost-effective. (Meeusen and Warren, 1989) . Successful 
insect control has been observed under field conditions 
(Delannay et al. , 1989 ; Meeusen and Warren, 1989) . 

In all cases, Aqrobacterium-mediated gene transfer 
has been used to express china eric Bt ICP genes in 
plants (Vaeck et al. , 1987 ; Barton et al. , 1987 ; 
Fischoff et al. , 1987) . Bt ICP genes were placed under 
the control of a strong promoter capable of directing 
gene expression in plant cells. It is however 
remarkable that expression levels in plant cells were 
high enough only to obtain insect-killing levels of Bt 
ICP genes when truncated genes were used (Vaeck et al . , 
1987 ; Barton et al. , 1987) . None of the transgenic 
plants containing a full-length Bt ICP gene produced 
insect-killing activity. Moreover , Barton et al. (1987) 
showed that tobacco calli transformed with the entire 
Bt ICP coding sequence became necrotic and died. These 
results indicate that the Bt ICP gene presents unusual 
problems that must be overcome to obtain significant 
levels of expression in plants. Even, when using a 
truncated Bt ICP gene for plant transformation , the 
steady state levels of Bt ICP mRNA obtained in 
transgenic plants are very low relative to levels 
produced by both an adjacent NPT II -gene, used as a 
marker , and by other chimeric genes (Barton et al., 
19 87 ; Vaeck et al. , 1987) . Moreover, the Bt ICP mRNA 
cannot be detected by northern blot analysis . Similar 
observations were made by Fischoff et al. (1987) ; they 
reported that the level of Bt ICP mRNA was much lower 
than expected for a chimeric gene expressed from the 
CaMV35S promoter . In other words , the cytoplasmic 
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accumulation of the bt mRNA, and consequently the 
synthesis , the accumulation and thereby the expression 
of the Bt ICP protein in plant cells, are extremely- 
inefficient. By contrast , in microorganisms , it has 
5 been shown that truncated Bt ICP genes are less 
favorable than full-length genes (Adang et al. , 1985) , 
indicating that the inefficient expression is solely 
related to the heterologous expression of Bt ICP genes 
in plants. 

10 The problem of obtaining significant Bt ICP 

expression levels in plant cells seems to be inherent 
and intrinsic to the Bt ICP genes . Furthermore, the 
relatively low and poor expression levels obtained in 
plants appears to be a common phenomenon for all Bt ICP 

15 

genes . 

It is known that there are six steps at which gene 
expression can be controlled in eucaryotes (Darnell, 
1982) : 

20 1) Transcriptional control 

2) RNA processing control 

3) RNA transport control 

4) mRNA degradation control 

5) translational control 

25 6) protein activity control 

For all genes , transcriptional control is 
considered to be of paramount importance (The Molecular 
Biology of the Cell, 1989) . 
3Q In European patent publications ("EP") 385, 962 and 

359, 472 , efforts to modify the codon usage of Bt ICP 
genes to improve their expression in plant cells have 
been reported . However, wholesale (i.e., non-selective) 
changes in codon usage can introduce cryptic regulatory 
signals in a gene, thereby causing problems in one or 
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more of the six steps mentioned above for gene 
expression, and thus inhibiting or interfering with 
transcription and/or translation of the modified 
foreign gene in plant cells. For example, changes in 
5 codon usage can cause differential rates of mRNA 
production, producing instability in the mRNA, so 
produced (e.g. , by exposure of regions of the mRNA, 
unprotected by ribosomes, to attack and degradation by 
cytoplasmic enzymes) . Changes in codon usage also can 
10 inadvertantly cause inhibition or termination of RNA 
polymerase II elongation on the so-modified gene. 

Summary of the Invention 

In accordance with this invention is provided a 
15 process for modifying a foreign gene, particularly a Bt 

I CP gene, whose level and/ or rate of expression in 
plant cells, transformed with the gene, is limited by 
the rate and/ or level of nuclear production of an mRNA 
encoded by the gene ; the process comprises the step of 

20 changing adenine and thymine sequences to corresponding 
guanine and cytosine sequences encoding the same amino 
acids in a plurality of translational codons of the 
gene that would otherwise directly or indirectly cause 
a nuclear event which would negatively control (i.e. , 

25 inhibit or interfere with) transcription, nuclear 
accumulation and/ or nuclear export of the mRNA, 
particularly transcription, quite particularly 
elongation of transcription by RNA polymerase II of the 
plant cells. Preferably, the adenine and thymine 

30 sequences are changed to cytosine and guanine sequences 
in translational codons of at least one region of the 
gene which, during transcription, would otherwise have 
thereon a relatively low percentage of RNA polymerase 

II as compared to another adj acent upstream (i.e., 5 ' ) 
region of the gene. 
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Also in accordance with this invention is provided 
the modified Bt ICP gene resulting from the process. 

Further in accordance with this invention, a 
process is provided for improving the resistance of a 
5 plant against insect pests by transforming the plant 
cell genome with at least one modified Bt ICP gene. 

This invention also relates to a chimaeric gene 
that can be used to transform plant cells and that 
contains the following operably-1 inked DNA fragments in 
10 the same transcriptional unit: 

1) the modified Bt ICP gene; 

2) a promoter suitable for directing transcription 
of the modified Bt ICP gene in the plant cells; 

15 and 

3) suitable transcript 3 1 end formation and 
polyadenylation signals for expressing the 
modified Bt ICP gene in the plant cells. 

This invention further relates to: 

20 

- a cell of a plant, the nuclear genome of which 
has been transformed to contain, preferably stably 
integrated therein, the modified Bt ICP gene, 
particularly the chimaeric gene; 

25 - cell cultures consisting of the plant cell ; 

- a plant which is regenerated from the 
transformed plant cell or is produced from the 
so-regenerated plant, the genome of which contains 
the modified Bt ICP gene, particularly the 

30 chimaeric gene, and which shows improved 

resistance to insect pests ; 

- seeds of the plant; and 

- a vector for stably transforming the nuclear 
genome of plant cells with the modified Bt ICP 
gene, particularly the chimaeric gene. 



Detailed Description of the Invention 

As used herein, "Bt ICP K should be understood as 
an intact protein or a part thereof which has 
insecticidal activity and which can be produced in 
nature by B. thurinqiensis . A Bt I CP can be a pro toxin, 
as well as an active toxin or other insecticidal 
truncated part of a protoxin which need not be 
crystalline and which need not be a naturally occurring 
protein. An example of a Bt ICP is a Bt2 insecticidal 
crystal protein (Hofte et al. , 1986) , as well as its 
insecticidally effective parts which are truncated at 
its c- and/or N-terminal ends towards its tryspsin 
cleavage site(s) and preferably having a molecular 
weight of 60-80 kDa. Other examples of Bt I CPs are: 
Bt2, Bt3, Bt4, Btl3, Btl4, Btl5, BtlS , Bt21, Bt22, 
Bt73, Bt208, Bt245, BtI260 and BtI109P as disclosed in 
PCT publications W090/15139 and WO90/09445, in Hofte 
and Whiteley (1989) and in EPA 90403724.9. 

As used herein , "protoxin" should be understood as 
the primary translation product of a full-length gene 
encoding a Bt ICP. 

As used herein, "toxin" or "active toxin" or 
"toxic core" should all be understood as a part of a 
protoxin which can be obtained by protease (e.g. , by 
trypsin) cleavage and has insecticidal activity. 

As used herein, "truncated Bt gene" should be 
understood as a fragment of a full-length Bt gene which 
still encodes at least the toxic part of the Bt ICP, 
preferentially the toxin. 

As used herein, "modified Bt ICP gene" should be 
understood as a DNA sequence which encodes a Bt ICP, 
and in which the content of adenine ("A") and thymine 
("T") has been changed to guanine ("G") and cy to sine 
("C") in codons , preferably at least 3 , in at least one 
region of the DNA sequence without affecting the 



original amino acid sequence of the Bt ICP. Preferably 
in at least two regions, especially in at least three 
regions, of the DNA sequence, the A and T content is 
changed to G and C in at least 3 codons . For regions 
downstream of the translation initiation site of the 
DNA sequence , it is preferred that the A-T content of 
at least about 10 codons , particularly at least about 
33 codons , be changed to G-C. 

By "region" of a modified Bt ICP gene is meant any 
sequence encoding at least three translational codons 
which affect expression of the gene in plants . 

In accordance with this invention, it has been 
shown by means of mRNA turn-over studies that the 
expression pathway of a Bt ICP gene, such as bt2, bt!4 , 
bt!5 and btl8, is specifically inhibited at the nuclear 
level in plant cells. In a further analysis, nuclei of 
transgenic tobacco plants, i.e. , N28 - 220 (Vaeck et 
al. , 1987) , were used in a nuclear run-on assay to 
determine the distribution and the relative efficiency 
of RNA polymerase II complexes to initiate 
transcription of chimaeric Bt ICP plant genes. In this 
regard, the run-on assay has been used to determine 
initially the relative efficiency of RNA polymerase II 
complexes to initiate transcription of Bt ICP genes and 
thereafter to determine the relative distribution and 
migration efficiency of the RNA polymerase II complexes 
on the Bt ICP genes. 

N28 - 220 contains the bt884 fragment under 
control of the TR 2 ' promoter as a chimaeric gene. 
Bt884 is a 5' fragment of the bt2 gene (Hofte et al. , 
1986) up to codon 610 (Vaeck et al. , 1987) . Using 
nuclear run-on analysis, isolated nuclei of N28 - 220 
were incubated with highly labeled radioactive RNA 
precursors , so that the RNA transcripts being 
synthesized at the time became radioactively labeled. 



The RNA polymerase II molecules caught in the act of 
transcription in the cell continue elongating the same 
RNA molecules in vitro. 

The nuclear run-on assays of nuclei of N28 - 220 
culture (non- induced cells and induced cells, TRl'- neo , 
TR2 ' -bt884) revealed that transcription from the TR1 ' 
and TR2 1 promoters is about equally efficient. This 
implies that the low Bt I CP (i.e. , BtB84) expression 
levels are not due to a specifically reduced 
transcriptional activity of the TR2 ' promoter . However, 
nuclear run-on analysis with N28 - 220 nuclei indicated 
that transcription elongation of the nascent Bt ICP 
mRNA is impaired somewhere between 700 to 1000 
nucleotides downstream of the start of transcription. 
This means that RNA polymerase II is not able to 
transcribe the Bt ICP coding sequence with 100 % 
efficiency. Filter binding assays using labeled Bt DNA 
fragments spanning this region and protein extract 
prepared from tobacco nuclei reveal that this DNA 
region undergoes specific interactions with proteins 
present in nuclei. These interactions are the prime 
candidates that cause or affect the impaired elongation 
of transcription by RNA polymerase II through this 
region. By modification of this region to abolish 
specific protein binding, Bt ICP expression levels will 
increase . However, other mechanisms responsible for 
impaired elongation in this region cannot be excluded. 

Further in accordance with this invention, 
sequences within the coding region involved in negative 
control of cytoplasmic Bt ICP mRNA levels have been 
identified by deletion analysis. To this end, 24 
deletion derivatives of pVE36 have been constructed . 
Three main types of deletion mutants have been 
constructed (see fig. 3) : 
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- 5 • end deletions- 

- 3 1 end deletions 

- internal deletions. 

5 The expression of a mutant hybrid bt2-neo gene 

(encoding a fusion protein of Bt2 (Hofte et al. , 1986) 
and NPTII) has been studied by means of transient 
expression experiments using the cat gene as a 
reference. To this end, the neo mRNA levels were 

10 measured in relation to cat mRNA levels in RNA extracts 
of SRI protoplasts. The ratio between the neo and cat 
mRNA level was used to quantify on a relative basis the 
nptll transcript (i.e. , mRNA) levels produced by the 
different constructions . These experiments show that 

15 progressive deletions of the carboxy-terminal (i. e. , 
3 • ) part or the amino- terminal (i. e. , 5') part of the 
Bt I CP coding sequence result in a gradual increase of 
the nptll transcript level. Furthermore , since the 
changes in transcript levels are not very abrupt, these 

20 results suggest that the low transcript levels produced 
by Bt ICP genes are not controlled by a single factor. 
Nevertheless, individual modifications of bt2 coding 
sequence can significantly reduce the interference 
and/ or inhibition of the expression of the mRNA encoded 

25 by Bt ICP genes in plant cells at the level of 
transcript elongation, nuclear accumulation and nuclear 
export . The modification (s) may also affect cytoplasmic 
regulation and metabolism of such mRNAs and their 
translation. 

30 Deletion analysis clearly indicates that several 

internal sequences , located within the Bt ICP coding 
region, might be involved in the negative regulation of 
the Bt ICP expression. By way of example, a 326 bp 
region (fig. 6b) was identified in the bt2 gene that is 
involved in the negative control of BT ICP expression 
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and that is located between nucleotide position 674 and 
nucleotide position 1000, particularly a 268 bp region 
between nucleotide positions 733 and 1000, quite 
particularly a 29 bp region between nucleotide 
positions 765 and 794 which carries two perfect CCAAT 
boxes which are known to be able to cause a reduction 
in elongation efficiency and termination of 
transcription by RNA polymerase II in animal systems 
(Connelly and Manley, 1989) . This internal gene 
fragment or inhibitory zone may itself comprise a 
plurality of inhibitory zones which reduce Bt ICP 
expression levels or which interact directly or 
indirectly with other zones to inhibit or interfere 
with expression. Codon usage of this inhibitory zone 
has been modified in a second step by substituting A - 
T with G - C without affecting the amino acid sequence. 
In this regard , this internal 326 bp fragment (fig. 6b) 
has been replaced with a modified Bt ICP fragment of 
this invention containing 59 modified codons . The 
effect of such modification of this inhibitory zone on 
Bt ICP expression has been analyzed both in transient 
and stable plant trans formants . The results show that 
such .modification of codon usage causes a significant 
increase of Bt ICP expression levels and hence improved 
insect-res istance . 

In addition, N-terminal deletion mutants of the 
bt2 gene have been made by deleting the first N- 
terminal 28 amino acids (Hofte et al. , 1986). It is 
known for the bt2 gene that the first 28 codons can be 
deleted without loss of toxicity (Hofte et al. , 1986; 
Vaeck et al. , 1987) . Also, codon usage for three 
codons , 29 to 31, has been changed in accordance with 
this invention by replacing A - T with G - c without 
affecting the amino acid sequence. Furthermore, an 
optimal translation initiation (ATG) site was created 
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based on the consensus sequence of Joshi (1987) as 

shown in fig. 6a. Plants transformed with this modified 
Bt I CP gene show significantly higher Bt ICP expression 
levels. 

In accordance with this invention, all or part of 
a modified Bt ICP gene of the invention can be stably 
inserted in a conventional manner into the nuclear 
genome of a plant cell, and the so-tr ans formed plant 
cell can be used to produce a transgenic plant showing 
improved expression of the Bt ICP gene . In this regard , 
a disarmed Ti-plasmid, containing the modified Bt ICP 
gene, in Agrobacterium (e.g., turoefaciens) can be 
used to transform a plant cell using the procedures 
described , for example, in EP 116,718 and EP 270, 822, 
PCT publication 84/02913, EPA 87400544.0 and Gould et 
al . (1991) (which are incorporated herein by 
reference) . Preferred Ti-plasmid vectors contain the 
foreign DNA sequence between the border sequence, or at 
least located to the left of the right border sequence, 
of the T-DNA of the Ti-plasmid. Of course, other types 
of vectors can be used to transform the plant cell, 
using procedures such as direct gene transfer (as 
described, for example, in EP 233,247) , pollen mediated 
transformation (as described, for example, in EP 
270,356, PCT publication WO 85/01856, and US patent 
4,684, 611) , plant RNA virus-mediated transformation (as 
described , for example , in EP 67, 553 and US patent 
4,407,956) , 1 iposome-mediated transformation (as 
described, for example, in US patent 4 ,536, 475) and 
other methods such as the recently described methods 
for transforming certain lines of corn (Fromm et al. , 
1990; Gordon-Kamm et al. , 1990). 

Preferably, the modified Bt ICP gene is inserted 
in a plant genome downstream of, and under the control 
of, a promoter which can direct the expression of the 
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gene in the plant cells. Preferred promoters include, 
but are not limited to, the strong constitutive 35S 
promoter (Odell et al. , 1985) of cauliflower mosaic 
virus; 35S promoters have been obtained from different 
isolates (Hull and Howell, Virology 86, 482-493 
(1987) ) . Other preferred promoters include the TR1 ' 
promoter and the TR2 ' promoter (Velten et al. , 1984) . 
Alternatively, a promoter can be utilized which is not 
constitutive but rather is specific for one or more 
tissues or organs. For example, the modified Bt ICP 
gene can be selectively expressed in the green tissues 
of a plant by placing the gene under the control of a 
light- inducible promoter such as the promoter of the 
ribulose - 1,5 - phosphate - carboxylase small subunit 
gene as described in EPA 86300291.1. Another 
alternative is to use a promoter whose expression is 
inducible by temperature or chemical factors . 

It is also preferred that the modified Bt ICP gene 
be inserted upstream of suitable 3 1 transcription 
regulation signals (i.e. , transcript 3 ' end formation 
and polyadenylation signals) such as the 3 ' 
untranslated end of the octopine synthase gene (Gielen 
et al. , 1984) or T-DNA gene .7 (Velten and Schell , 
1985) . 

The resulting transformed plant of this invention 
shows improved expression of the modified Bt ICP gene 
and hence is characterized by the production of high 
levels of Bt ICP. Such a plant can be used in a 
conventional breeding scheme to produce more 
transformed plants with the same improved insect- 
resistance characteristics or to introduce the modified 
Bt ICP gene into other varieties of the same or related 
plant species . Seeds , which are obtained from the 
transformed plants , contain the modified BtlCP gene as 
a stable genomic insert . 
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Furthermore, at least two modified BtlCP genes, 
coding for two non-co21pet.it ively binding anti- 
Lepidopteran or anti-Coleopteran Bt ICPs, can be cloned 
into a plant expression vector (EPA 89401499 .2) . Plants 
transformed with such a vector are characterized by the 
simultaneous expression of at least two modified BtlCP 
genes . The resulting transgenic plant is particularly 
useful to prevent or delay development of resistance to 
Bt I CP of insects feeding on the plant. 

The following Examples illustrate the invention 
and are not intended to limit its scope . The Figures, 
referred to in the Examples, are as follows: 

Fig. 1 — Comparison of the transcription initiation 
frequency of RNA polymerase II complexes in nuclei of 
N28-220. Hybridisation efficiencies of labeled nptll 
mRNA and Bt I CP mRNA with their complementary DNA 
counterparts present on a Southern blot were compared . 
DNA fragments were obtained from a digest of plasmid 
pGSH163. A schematic view of the region is given. The 
lengths of the fragments blotted on Hybond-N filter 
(1) , the homologous genes on plasmid pGSK163 (2) , and 
the densitometric values (3) are as follows: 

Digest: 12 3 

BamHI/Hindlll 2358 neo 12386 

1695 bt2 6565 

154 bt2 

6250 vector 

Fig. 2a — Determination of the distribution of the RNA 
polymerase II complexes on the Bt ICP coding sequence 
in nuclei of N28-220 . The hybridisation of labeled RNA 
prepared by nuclear run on with DNA fragments of the Bt 
ICP coding sequence was guantitated. The restriction 
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fragments and scanning values are given in the table 
and figure. The scanning value is proportional to "X" , 
the size of the DNA fragment and the #' UTP per RNA 
fragment hybridising. "X" is directly proportional to 
the number of RNA polymerases passing through the DNA 
fragment . "X" is proportional to the scanning value 
divided by the number of OTPs. The X values of the 
different restriction fragments are shown in the 
figure. In this regard, conversion of the different 
densitometric values into relative hybridisation 
efficiencies by normalising the values of the number of 
dATPs present in the DNA fragment, complementary to the 
hybridising RNA, generates the value "X". "X" is a 
relative measure of the number and the length of the 
extension of the transcripts. "X" thus reflects the 
number of RNA polymerases transcribing a specific DNA 
sequence and their elongation rate. DNA fragments 
present on the Southern digests of plasmid DNA of plant 
vector pGSH163 each have the following lengths of 
fragments blotted on Hybond-N filter (1) , homologous 
genes on plasmid pGSH163 (2) and densitometric values 
(3): 



30 



Digest : 


1 


2 


3 


BamHI/EcoRI 


8877 


neo 


15333 




726 


bt2(2) 


2926 




583 


bt2{3) 


635 




271 


bt2(l) 
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1695 bt2 6565 

154 

BamHI/SacI 8053 neo 14194 

1353 bt2(l) 4572 

1051 bt2(2> 615 

Xmnl 4973 neo 13219 

2107 
1401 

729 bt2 (3) 736 

628 bt2(2) 1817 

305 bt2(4) 

188 bt2(5) 

120 bt2(l) 

Fig. 2b — Schematic view of nine bt88 4 DNA fragments 
that were inserted into the poly linker of Ml 3 vectors , 
MP18 and MF19 (Yanisch-Perron et al. , 1985) . The Bt I CP 
coding sequence is shown from AUG to 1600 nucleotides 
downstream. The relevant restriction sites and sizes of 
the DNA fragments are indicated. The nucleotide 
numbering is relative to the AUG. The subclones were 
named pJD71, pJD72, pJD73, etc. (to pJD79) , as 
indicated. The inserts were oriented into the M13 
vector such that single standed M13 carried the 
fragments of the Bt ICP coding sequence in an anti- 
sense orientation. 

Fig. 2c — Schematic representation of three nuclear 
run-on analyses with N28-220 nuclei as described by Cox 
and Goldberg (1988) . Assays were performed for periods 
of 5, 10 and 30 minutes . The labeled nuclear RNA was 
allowed to hybridize with 5 /jg of single stranded 
pJD71-pJD79 and MP18 DNA, which were immobilised on 
nylon membranes . The membranes were autoradiographed , 
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and densitometric values were obtained by scanning the 
autoradiography . The abscissa shows the nucleotide 
position relative to the AUG of the Bt (i.e., bt2) 
coding sequence. The center of each of the single 
5 stranded Bt DNA fragments is indicated in the graph . 
The ordinate gives the relative hybridisation signal 
for each fragment corrected for the number of dATPs in 
the fragment and adjusted to 100% for the value of 
pJD71 for each of the three incubation periods. All 

10 values are corrected for non-specific hybridisation to 
single stranded MP18 DNA. The relative values are a 
measure for the reactivation of bt mRNA synthesis by 
RNA polymerase II. The assay does not distinguish 
between the number of mRNA extensions and the length of 

15 mRNA extensions . 

Fig. 3 — Construction of deletion mutants of the 
bt860-neo gene to measure the effect on cytoplasmic Bt 
I CP mRNA levels. The parental vector pVE36 is shown. 
20 The following deletion mutants were generated: 

1. PJD50 : . pJD50 was derived from pVE36 by digesting 

with BamHI and SphI . The 5 1 and 3 1 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
25 was ligated and then used to transform 

MC1061 cells. Trans formants were selected 
for amp r phenotype . 

2 . PJD51 : pJD51 was derived from pVE36 by digesting 

with Spel and SphI . The 5 1 and 3 1 
30 protruding ends were filled in with Klenow 

DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transf ormants were selected 
for amp r phenotype . 
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pjD52 was derived from pVE36 by digesting 
with EcoRV and Sphl. The 5 'and 3 e 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme . The treated DNA 
was ligated and then used to transform 
MC1061 cells. Trans formants were selected 
for amp r phenotype . 

pjD53 was derived from pVE36 by digesting 
with Xcal and Sphl . The 3 ' protruding ends 
were filled in with Klenow DNA polymerase 
I enzyme. The treated DNA was ligated and 
then used to transform MCI 061 cells. 
Trans formants were selected for amp r 
phenotype . 

pJD54 was derived from pVE36 by digesting 
with Aflll and Sphl. The 5 * and 3 * 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Trans formants were selected 
for amp r phenotype. 

pJD55 was derived from pVE36 by digesting 
with Clal and Sphl. The 5 ' and 3 • 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Trans formants were selected 
for amp r phenotype . 

pjD56 was derived from pVZ36 by digesting 
with Xhol and Sphl. The 5' and 3 * 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 
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8- PJD57: pJD57 was derived from pVE36 by digesting 
with Af III and BamHI. The 5' and 3 ' 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Trans formants were selected 
for amp 1 " phenotype . 

9 . PJD58 : pJD58 was derived from pVE36 by digesting 

with Xcal and BamHI . The 5 ' protruding 
ends were filled in with Klenow DNA 
polymerase I enzyme . The treated DNA was 
ligated and then used to transform MC1061 
cells . Trans formants were selected for 
amp r phenotype . 

10. PJD59: pJD59 was derived from pVE36 by digesting 

with EcoRV and BamHI. The 5 ' protruding 
ends were filled in with Klenow DNA 
polymerase I enzyme. The treated DNA was 
ligated and then used to transform MC1061 
cells. Trans formants were selected for 
amp r phenotype. 

11. PJD6 0 : pJD60 was derived from pVE36 by digesting 

with Spel and BamHI. The 5 ' protruding 
ends were filled in with Klenow DNA 
polymerase I enzyme . The treated DNA was 
ligated and then used to transform MC1061 
cells. Trans formants were selected for 
amp r phenotype. 

12. PJD61: PJD6.1 was derived from PJD50 . PVE36 was 

digested with Xbal and filled in with 
Klenow polymerase I . PJD50 was linearized 
with BamHI and filled in with Klenow 
polymerase I. The 375bp Xbal fragment of 
PVE3 6 was ligated in the filled in BamHI 
of pJD50. The ligation mixture was used to 
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transform MC1061 cells. Transformants were 
selected for amp r phenotype. 

13 . PJD62 : PJD62 was derived from PJD50. PVE36 was 

digested with Xcal and EcoRV. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 367bp Xcal -EcoRV 
fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform MC1061 
cells. Transformants were selected for 
amp r phenotype. 

14. PJD63 : PJD63 was derived from PJD50. PVE36 was 

digested with Xcal and EcoRV. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 474bp Xcal -EcoRV 
fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform MC1061 
cells. Transformants were selected for 
amp r phenotype. 

15. PJD64 : PJD64 was derived from PJD50. PVE3 6 was 

digested with EcoRI and EcoRV and filled 
in with Klenow polymerase I. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 458bp EcoRI -EcoRV 
fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform MC1061 
cells. Transformants were selected for 
amp r phenotype. 

16. PJD65 : PJD65 was derived from PJD50. PVE3 6 was 

digested with EcoRI and Xbal and filled in 
with Klenow polymerase I. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 327bp EcoRI-Xbal 
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fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform MC1061 
cells. Trans fonnants were selected for 
amp r phenotype . 

17 . PJD66: PJD66 was derived from PJD50 . PVE36 was 

digested with Spel and Xcal and filled in 
with Klenow polymerase I. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 1021bp Spel-Xcal 
fragment of PVE36 was ligated in the 
filled in BamHI of pJDSO. The ligation 
mixture was used to transform MC1061 
cells. Trans fonnants were selected for 
amp r phenotype. 

18. PPS56D1: PPS56D1 was derived from PJD56 by 

digesting with EcoRV . The treated DNA was 
ligated and then used to transform MC1061 
cells. Trans f ormants were selected for 
amp r phenotype. 

19. PPS56D2: PPS56D2 was derived from PJD56 by 

digesting with Xcal and Aflll. The 5' 
protruding ends were filled in with Klenow 
polymerase I. The treated DNA was ligated 
and then used to transform MC1061 cells. 
Transformants were selected for amp r 
phenotype. 

20. PPS56D3: PPS56D3 was derived from PJD56 by 

digesting with Spel and EcoRV. The 5' 
protruding ends were filled in with Klenow 
polymerase I. The treated DNA was ligated 
and then used to transform MC1061 cells. 
Transformants were selected for amp r 
phenotype . 



21. PPS56D4; PPS56D4 was derived from PJD56 by 

digesting with Xcal and partially with 
EcoRV. The treated DNA was ligated and 
then used to transform HC1061 cells. 
Transformants were selected for a»p r 
phenotype. 

22. PPS56D6: PPS56D6 was derived from PJD56 by 

digesting with Spel and partially with 
EcoRV . The 5» protruding ends were filled 
in with Klenow polymerase I. The treated 
DNA was ligated and then used to transform 
MCI 061 cells. Transformants were selected 
for amp r phenotype . 

23. PPS56D7 : PPS56D7 was derived from PJD56 by 

digesting with Spel and Xcal. The 5» 
protruding ends were filled in with Klenow 
polymerase I. The treated DNA was ligated 
and then used to transform MC1061 cells. 
Transformants were selected for amp r 
phenotype. 

24. PPS56D8 : PPS56D8 was derived from PPS56D2 by 

digesting with Spel and partially with 
EcoRV . The 5* protruding ends were filled 
in with Klenow polymerase I. The treated 
DNA was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype . 

Fig. 4 — Effect of deletions in the Bt I CP coding 
sequence on cytoplasmic Bt ICP mRNA levels. The 
cytoplasmic mRNA levels specified by the invariable cat 
reference gene and the different Bt ICP deletion 
mutants described in fig. 3 are listed in the table. 
The measurements were converted into relative Bt ICP 
mRNA abundances . Bt ICP and cat mRNA quantizations were 



done as described by Cornel issen (1989). Total RNA was 
slot blotted and hybridised with radi oactively labeled 
RNA complementary to the neo snd cat coding sequences. 
Values were quant ita ted with the aid of calibration 
curves of cold cat and Bt ICP riboprobe transcripts. 

Fig. 5 — Relative transcript levels produced by the 
deletion derivatives of pVE36. 

Fig. 6a — Schematic presentation of the synthetic DNA 
sequences used to introduce a N-terminal deletion and a 
change of the codons 29, 30 and 31 of the bt2 coding 
sequence. The oligo nucleotides were annealed according 
to Engler et al . (1988) and cloned into the BstXI 
restriction site of plasmid pVE36, yielding pPS027. The 
7360 bp fragment of pPS027 was ligated to the the 1177 
bp Clal restriction fragment of pVE3 6, yielding plasmid 
PPS028. pPS02 8 is identical to pVE36 apart for the N- 
terminal modification. 

Fig. 6b — Schematic presentation of the synthetic DNA 
sequences used to introduce an internal modification 
into the bt2 coding sequence . The oligonucleotides were 
annealed and ligated as described by Engler et al. 
(1988) and the resulting concatemeric DNA fragment was 
cut with the restriction enzymes Xbal and EcoRI to 
release the modified 327 bp Xbal -EcoRI restriction 
fragment. This fragment was ligated into the 3530 bp 
EcoRI -Xbal fragment of pPS023 which is a pUC19 
derivative ( Yan isch-Perron et al. , 1985) that carries 
the 1533 bp Aflll (filled in) BamHI fragment of pVE36 
in the Hindlll (filled in) BamHI site of pUC19, 
resulting in plasmid pPS024. Plasmid pPS024 was 
linearised by digestion with restriction enzyme Xbal 
and the 375 bp Xbal restriction fragment of pPS023 was 
introduced resulting in pPS025. The 1177 bp Clal 
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fragment of pPS025 was introduced in the 7360 bp Clal 
restriction fragment of pPS027 yielding pPS029. pPS029 
is identical to pVE36 but carries both the amino- 
tenninal modification and the internal modification of 
5 the Bt I CP coding sequence. 

Fig. 6c — Nucleotide sequences 800 to 4000 of the 
plasmids pVE36 and pPS029. "x" refers to not known 
nucleotides. 

10 

Fig. 7 — Schematic presentation of the effect of the 
mutations on the AT content of the Bt I CP plant gene. 
The modified regions are indicated. 

Fig. 8a — Schematic presentation of the plasmid 
15 constructions used in the transient expression assay. 
The relevant genes are indicated. 

Fig. 8b — Accumulation profiles of CAT (Neumann et 
al. , 1987) and the modified BtlCP (Engvall and Pesce, 
20 1978) in a typical transient expression assay. 

Unless otherwise stated in the Examples, all 
procedures for making and manipulating recombinant DNA 
are carried out by the standardized procedures 
described in Sambrook et al . , Molecular Cloning - A 

25 

laboratory Manual , Cold Spring Harbor Laboratory 
(1989) . 

Example 1. Determination of the Efficiency of 
Transcription initiation 

30 The relative efficiency of RNA polymerase II 

complexes to initiate transcription at chimaeric BtlCP 
plant genes was studied, using transgenic plant N28-220 
which is described by Vaeck et al. (1987) and contains 
copies of the T-DNA of plasmid pGSH163 This T-DNA 
carries the chimaeric plant genes P TR2 bt8843 ' g7 and 



P^.neoS 'qcs. Nuclei of 25 g of induced leaves of 
N28-220 were prepared according to Cox and Goldberg 
(1988) and stored the nuclei at a temperature of -70°C. 
This method causes the nascent precursor mRNA chains 
and the RNA polymerase II complexes to halt while the 
complexes remain associated at the DNA. A batch of 
these nuclei was assayed for the ability to incorporate 
radioactively labeled UTP as a measure for the 
transcript ional viability of the nuclei (Cox and 
Goldberg (1988) . This incorporation could be 
successfully repressed by addition of a-amanitin to a 
final concentration of 2 pq/w.1. This shows that the UTP 
incorporat i on was due to transcript elongation by RNA 
polymerase II and that RNA synthesis on the protein 
coding genes which are occupied by RNA polymerase II 
can be reactivated under the appropriate experimental 
conditions . 

Batches of the nuclei of N28-220 were used to 
synthesize radioactively labeled RNA as described by 
Cox and Goldberg (1988) . The radioactive RNA 
synthesized is a direct representation of the 
distribution of the RNA polymerases II complexes on the 
DNA in the nuclei. As the DNA of N28-220 carries two 
genes which can be assayed, namely the chimaeric neo 
gene and the chimaeric Bt I CP gene, it is possible to 
compare the distribution of RNA polymerase II complexes 
on these two genes. To this end, the radioactive RNA 
was extracted from the nuclei according to Cox and 
Goldberg (1988) and used as a probe in a conventional 
Southern hybridisation . The Southern blot contained DNA 
fragments carrying the Bt I CP and neo coding sequences 
in a molar excess relative to the neo and Bt I CP RNA 
species present in the radioactive probe. A detailed 
description of the Southern blot is given in fig. l. 
The hybridisation experiment resulted in hybridisation 



signals to both the neo and Bt ICP coding sequences 
(fig. 1} . Densitometry c scanning showed that the 
intensity of the hybridisation signal to the neo and Bt 
ICP coding regions was nearly identical. This result 
implies that the number of transcripts initiating from 
the TR dual promoter is about similar in both 
directions. As in plant N28-220 the cytoplasmic neo 
mRNA level is several magnitudes higher than that of Bt 
ICP; this shows that the Bt ICP coding sequence indeed 
negatively controls accumulation of cytoplasmic Bt ICP 
mRNA, but that this phenomenon is not due to a dominant 
negative effect on transcription initiation of the 
chimaeric Bt ICP plant gene. 

Example 2 . Transcription Elongation 

The relative distribution of RNA polymerase II 
complexes on the Bt ICP plant genes present in 
transgenic plant N28-220 which is described by Vaeck et 
al. (1987) was investigated. To this end, a second 
experiment was carried out with batches of the nuclei 
of N28-220 described in Example 1. 

The nuclei were incubated as described by Cox and 
Goldberg (1988) to synthesize radioactively labeled 
RNA. The radioactive RNA was extracted as described 
previously to provide a probe for a Southern 
hybridisation. The Southern blot prepared for this 
experiment contained several fragments of the Bt ICP 
coding sequence in molar excess relative to the 
complementary RNA present in the probe. The rationale 
of the experiment was that if the RNA polymerase II 
complexes were equally distributed over the Bt ICP 
coding region, the hybridisation with the different Bt 
ICP DNA fragments present on the Southern blot would be 
proportional to the size and dATP content of the 
different fragments . A detailed description of the DNA 



fragments present on the Southern is given in fig 2a. 
The hybridisation of the radioactive RNA extracted from 
the nuclei of N28-22Q with the Southern revealed that 
the complete Bt ICP coding sequence as present in 
N28-220 is transcribed by RNA polymerase II. 

Quantification of the hybridisation signals by 
densitometric scanning of the autoradiogram showed that 
more radioactively labeled RNA was hybridising with DNA 
fragments representing Bt ICP sequences located 5' on 
the Bt ICP coding sequence than with Bt ICP sequences 
located 3 s on the Bt ICP coding sequence. The actual 
values are given in fig 2a. This in vitro experiment 
demonstrates that in vivo the RNA polymerases are not 
evenly distributed over the Bt ICP coding sequence. 

The site(s) involved in reducing the RNA 
polymerase II elongation were then determined more 
accurately . Nine M13 derivatives were made that carry 
overlapping fragments of the Bt2 coding sequence 
spanning the region from the AUG to 1584 nucleotides 
downstream. The inserts were oriented into the vector 
such that, in single stranded Ml 3 derivatives, the Bt 
sequences were complementary to the Bt transcript. A 
schematic view of the Ml 3 clones is given in fig. 2b. 

A molar excess of each single stranded anti-Bt DNA 
was bound to nylon filters to serve as a DNA target for 
hybridisation with labeled RNA prepared from nuclear 
run-on assays with N28-220 nuclei as described by Cox 
and Goldberg (1988) . Three nuclear run-ons that 
differed only in their time period of incubation were 
carried out simultaneously . The incubation time 
determines the length of extension of the nascent mRNA 
chain. Shorter incubation periods give a more accurate 
view of the position of the RNA polymerase II complexes 
relative to the substrate DNA and their ability to 
elongate at the moment of the start of incubation. 



Hence, the shorter the in vitro incubation period, the 

more accurate the view in predicting the in vivo 
situation. 

The results are shown in fig. 2c. The data for the 
5 minute incubation show that, in vivo , at a very 
discrete inhibitory zone along the bt2 coding sequence, 
one or more factors interfere with transcript 
elongation and that such factor (s) remain present in 
such inhibitory zone during the course of the in vitro 
mRNA extension reaction. Increased incubation periods 
show that, on a subset of DNA templates, RNA synthesis 
resumed downstream of such inhibitory zone in this 
assay without significantly removing the inhibition in 
the inhibitory zone itself. In this regard, the data 
indicate that: 

1. The inhibitory zone causes the RNA polymerases to 
pause and not to terminate. 

2. This pause is only transitory for a small fraction 
of the Bt DNA templates which were used. 

3. The continued RNA polymerase elongation, 
downstream of the inhibitory zone, is done by a 
large number of polymerases on the relatively 
small fraction of the Bt DNA templates. 

It is believed, therefore, that low cytoplasmic Bt 
mRNA levels are due at least in part to inefficient 
production of precursor mRNA caused by inefficient 
elongation of a nascent transcript and/ or stalling of 
RNA polymerase II complexes from transcribing at an 
inhibitory zone. 

The inhibitory zone was assayed for its ability to 
interact with proteins present in nuclei of tobacco 
protoplasts . A crude nuclear extract was prepared from 
tobacco SRI leaf protoplasts according to Luthe and 
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Quatrano (1980) and used for filter binding assay 
essentially as described by Diffley and Stillman 
(1986) . 100 ng samples of protein extract were mixed 
with different amounts of radioactively labeled 532 bp 
5 Xbal-AccI bt884 DNA fragment , ranging from 0 to 1670 
picomolar, in a final volume of 0.150 ml binding buffer 
(10 mM Tris pH 7.5, 50 bK NaCl, 1 mM DTT, 1 mM EDTA and 
5% glycerol) . After 45 minutes incubation at room 
temperature, the samples were filtered through an 

10 alkali -washed nitrocellulose membrane and washed twice 
with 0. 150 ml of an ice-cold solution containing 10 mM 
Tris pH 7.5, 50 mM NaCl and 1 mM EDTA. The retention of 
DNA -protein complex was quantified by scintillation 
counting and revealed that the binding had a 

15 dissociation constant in the 100 picomolar range . The 
binding was not affected by preincubation of the 
nuclear extract with a molecular excess of a specific 
competitor DNA. 

20 Example 3. construction of Deletion Mutants 

The previous two examples demonstrate that the Bt 
I CP coding sequence in a chimaeric plant gene 
negatively affects the cytoplasmic Bt I CP mRNA level 
directed by the chimaeric plant gene. It is shown that 

25 this negative control is not at the level of 
transcription initiation but at least in part due to a 
reduced ability of RNA polymerase II to generate 
precursor Bt I CP mRNA. A deletion analysis of the 
chimaeric Bt I CP plant gene was performed to identify 

30 whether impaired transcription elongation is the 
exclusive mechanism by which the Bt ICP sequence 
interferes with gene expression. The rationale of the 
experiment is that the introduction of specific 
deletions in the Bt ICP coding region could remove or 
inactivate the sequence element (s) responsible for the 
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negative control. As a result such mutant gene would 
direct an increased level of cytoplasmic mRNA. This 
method can therefore be used to map and identify the 
sequence (s) involved in the negative control. 
5 To perform this analysis, a deletion series of the 

bt860-neo gene (Vaeck et al., 1987) was Bade. Fig. 3 
gives a schematic representation. The resultant 
deletion derivatives do not specify a Bt ICP and 
therefore are assayed at RNA level only. In order to 

10 ob tain accurate Bt ICP mRNA concentration values, the 
deletion mutants were compared in a transient 
expression system using tobacco leaf protoplasts of SRI 
(Cornelissen and Vandewiele, 1989). The relative mRNA 
abundances were calculated using a correction factor 

15 provided by the mRNA level specified by the cat 
reference gene present on the same plasmid as the 
mutant Bt ICP gene. Four hours after introduction of 
the genes the tobacco leaf protoplasts were harvested, 
and total RNA was prepared and analysed (fig. 4) . 

20 The mutants nos. 50-60 (fig. 3) show that 

progressive deletions of the carboxy -terminal part or 
the amino-terminal part of the Bt ICP coding sequence 
result in a gradually increasing neo transcript level. 
As there are not very abrupt changes in transcript 

25 levels> these results suggest that the low transcript 
level produced by full length Bt ICP genes is 
controlled by a number of signals. Deletions within the 
Bt ICP coding sequence indeed did not localise a 
specific sequence element which, by itself, is 

30 responsible for the low Bt ICP mRNA level. Similarly, 
cloning of fragments of the Bt ICP coding sequence in 
pJD50 (fig. 3) did not allow identification of such a 
region . 

The relative transcript levels were plotted 
against the length of the Bt ICP sequence present in 



WO 91/16432 



PCT/EP9 1/00733 



32 



the different deletion derivatives. Fig. 5 suggests 
that hybrid Bt ICP-neo transcript levels drop with 
increasing length of the Bt ICP sequence. In this 
respect, the mutants nos. 61-66 (fig. 3) form M 
exception as they show in average a low transcript 
level relative to the length of the Bt ICP sequence. 

These results show that the low transcript levels 
of Bt ICP plant genes in tobacco are not exclusively 
due to an impaired elongation of the nascent transcript 
but that a number of signals operate to cause a reduced 
expression capacity of the chimaeric Bt ICP gene. 

Example 4 » 

To determine whether cytoplasmic events are 
important in causing inefficient expression of the bt2 
gene in plants, the following test was carried out. 
Cytoplasmic bt2 mRNA steady state levels in transgenic 
leaf protoplasts of K28-220 are normally found to be 
below 1 transcript per cell. The steady state level is 
determined by, and is proportional to, the number of 
bt2 transcripts entering per time unit the cytoplasm 
and the cytoplasmic half-life of the transcript. When 
steady state levels are achieved, the absolute numbers 
of transcripts entering and leaving the cytoplasmic bt2 
mRNA pool are equal. Therefore, the cytoplasmic half- 
life and cytoplasmic steady state level of the bt2 
transcript will reveal whether its cytoplasmic steady" 
state level is due to a relatively low import of bt2 
transcript, a relatively high turnover (i.e., 
conversion to a protein) rate, or a combination of 
both. 

The cytoplasmic turnover of bt884 transcripts was 
determined according to Gallie et al. (1989) . a capped 
and polyadenylated synthetic bt884 mRNA was produced in 
vitro according to protocols of Promega Corporation 
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(Madison, Wisconsin, USA) and introduced into tobacco 
leaf protoplasts simultaneously with a synthetic bar 
(De Block et al. , 1987} mRNA. The two synthetic 
transcripts differed only in their coding sequences. At 
various times after RNA delivery, samples were taken, 
and total RNA was isolated. Northern analyses revealed 
that the half-lives (T 1/2) of the synthetic bt884 and 
bar transcripts were about 8+3 hours and 5 + 2 hours, 
respectively. See Table 1, below. These data show that 
the bt884 coding sequence , more particularly the bt884 
cod on usage and the AU-rich motifs in the bt884 coding 
sequence, do not render the btB84 mRNA more unstable 
than the bar mRNA which is known to accumulate in the 
cytoplasm to about 1000 transcripts per tobacco leaf 
protoplast (calculated from Cornelissen, 1989) . The low 
cytoplasmic steady state level of the bt884 transcripts 
is, therefore, caused by a lack of import of 
transcripts into the cytoplasm. Thus, the expression 
defect of the btSB4 gene has to be restored by 
introduction of modifications in the bt884 coding 
sequence that improve the expression pathway in the 
nucleus. 

Expression of the bt!4 , bt!5 and bt!8 genes in 
tobacco revealed that these genes also direct low 
cytoplasmic mRNA steady state levels. Therefore, a 
similar analysis was carried out with synthetic btl4 , 
bt!5 and bt!8 transcripts to identify whether the 
expression defect had a cytoplasmic or nuclear 
character. Table 1, below, shows that all three 
transcripts behave as stable mRNAs in the cytoplasm of 
tobacco leaf protoplasts. Therefore , bt!4 , bt!5 and 
bt!8 genes , like the bt884 gene, must be deficient in 
exporting high levels of bt transcript to the 
cytoplasm, and to improve the expression of such genes, 
it is necessary to modify their coding sequences so 
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that nuclear events, which interfere with efficient 

gene expression, are avoided or ameliorated. 

Table 1 

5 

Half-life determination of synthetic bt and bar mRNAs 
in Nicotiana tabacum cv. Petite Havanna SRI leaf 
protoplasts 



10 



15 



Example 


l st mRNA 


Tl/2 


2 nd mRNA 


Tl/2 






(Hours) 




(Hours) 


A 


bt884 


8+/"3 


bar 


5+/ -2 


B 


bt!4 


7+/"2 


bar 


6+/"3 


C 


bt!5 


12+/-5 


bar 


21+/-12 


D 


bt!8 


10+/-5 


bar 


12+/-5 
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The synthetic bar transcripts had a length of 783 bases 
and included a cap, the TKV leader (77 bases, Danthinne 
and Van Emmelo, 1990) , the bar coding sequence (552 
bases ; De Block et al . , 1987) , a trailer of 52 
nucleotides consisting of the bases GADCA CGCGA AUU and 
39 bases from the pGEM-3Z (Pr omega) poly linker (Kpnl 
(T4 DNA pol . ) -Hindlll (T4 DNA pol.) , and a poly (A) of 
the composition (A) ^G ( A) (A) 32 , followed by the 
nucleotides GCU. 

The synthetic bt884 transcripts had a length of 2066 
bases and included a cap, the TMV leader (77 bases) , 
the bt884 coding sequence followed by the trailer until 
the Klenow treated PstI site (1843 nucleotides) , the 
trailer continued with AAUUC CGGGG ADCAA UU, 39 bases 
of the pGEM-3Z poly linker and the (A) 33 G(A) 32 G(A)2 1 
poly (A) , followed by the nucleotides CG. 



The synthetic bt!4 transcripts had a length of 2289 
bases and included a cap,, the TKV leader (77 bases) , 
the bt!4 coding sequence till the Klenow treated Bell 
site (2023 bases) , plus 26 supplementary nucleotides CG 
UCG ACC UGC AGC CAA GOT UGC UGA, a trailer starting 
with UUGAU UGACC GGAUC CGGCU CUAGA AUU, followed by 39 
bases of the pGEM-3Z poly linker, and the 
(A) 33 G(A) 32 G(A) 21 poly (A) , followed by the nucleotides 
CGGUA CCC. 

The synthetic bt!5 transcripts had a length of 2198 
bases and included a cap, the TMV leader (77 bases) the 
bt!5 coding sequence as in pVE35 (PCT publication 
WO90/15139) followed by the trailer till the Klenow 
treated BamHI site (1989 bases) , the trailer then 
continued with AAUU, 39 bases of the pGEM-3Z polyl inker 
and the (A) ^G (A) 32 G (A) 21 poly (A) , followed by the 
nucleotides CG. 

The synthetic bt!8 transcripts had a length of 2184 
bases and included a cap, the TKV leader (77 bases) the 
bt!8 coding sequence until the Klenow treated BcLI site 
(1918 bases) , followed by 26 nucleotides until the 
translation stop CG UCG ACC UGC AGC CAA GCU UGC UGA, a 
trailer starting with UUGAU UGACC GGAUC GAUCC GGCUC 
AGAUC AAUU, 39 bases of the pGEM-3Z polylinker and the 
(A) 33 G(A) 32 G (A) 21 poly (A) , followed by the nucleotides 
CG. 

Example 5. Construction of Modified Bt ICP Genes 

Examples 1-4 show that the expression in a plant 
of a Bt ICP gene is negatively affected by the Bt ICP 
coding sequence at both transcriptional and post- 
transcriptional levels, but principally by nuclear 
events . These examples also show that the control of 
expression is not confined to a specific DNA sequence 
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within the Bt I CP coding sequence. Instead, the 
negative effect on gene expression is an intrinsic 
property of the Bt ICP coding sequence. On this basis, 
it is believed that, by directed change of the DNA 
sequence of the Bt ICP coding region, an improvement of 
gene expression will occur. The improvement will be of 
a cumulative type as the negative influence of the Bt 
ICP coding region is spread over the complete coding 
sequence. Similarly, an improvement of gene expression 
will be obtained by reduction of the length of the Bt 
ICP coding sequence. This improvement will have a 
cumulative effect if used in combination with 
modifications of the Bt ICP coding region. 

Therefore, two types of modifications were 
introduced into a Bt ICP (i.e., bt2) coding sequence 
which , as will be shown, indeed resulted in a 
significant increase in Bt ICP plant gene expression. 
First, the DNA sequence was modified in the central 
region of the toxic core fragment of the Bt I CP as 
transcription elongation is impaired in this region. 
Secondly, the length of the Bt ICP coding sequence was 
reduced as the negative influence is proportional to 
the length of the Bt ICP coding sequence. A detailed 
description of the mutations is given in figs. 6a, b 
and c. As shown in fig. 7, the modifications change the 
AT-content of the chimaeric Bt ICP gene significantly. 
The modifications change the primary DNA structure of 
the Bt ICP coding sequence without affecting the amino 
acid sequence of the encoded protein. It is evident 
that, if more DNA mutations were to be introduced into 
the Bt ICP coding sequence, a further improvement of 
gene expression would be obtained . 

To determine the effect of the modifications, the 
expression properties of the modified BtlCP gene and 
the parental bt860-neo gene were compared in a 
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transient expression system as described by Cornelissen 
and Vandewiele (1989), and Denecke et al . (1989) . 
Basically, the accumulation profiles of the genes under 
study were compared by relating their profiles to the 
5 profile of a reference gene present in the same 
experiment. Fig. 8a shows the vectors used in the 
assay, and fig. 8b shows that the accumulation of the 
reference CAT protein is nearly identical in both 
experiments . It is not possible to measure the 

10 accumulation of Bt I CP encoded by the parental 
bt860-neo gene, but the modified Bt I CP gene clearly 
directs an increased synthesis of Bt ICP. 

These results demonstrate that mutation of the Bt 
ICP coding sequence relieves the negative influence of 

15 the Bt ICP coding sequence on the expression of a Bt 
ICP plant gene. 

Example 6. Cloning and Expression of Modi fi ad BT ICP 
Genes in Tobacco and Potato Plants 

20 Using the procedures described in US patent 

application 821,582, filed January 22 , 1986 , and EPA 
86300291.1, EPA 88402115.5 and EPA 89400428.2, the 
modified Bt ICP (i.e. , bt2) genes of figs. 6 and 7 are 
inserted into the intermediate T-DNA vector , pGSH1160 

25 (Deblaere et al. , 1988) between the vector's T-DNA 
terminal border repeat sequences . 

To obtain significant expression in plants, the 
modified Bt ICP genes are placed under the control of 
the strong TR2 ' promoter (Velten et al. , 1984) and are 

30 fused to the transcript 3 ' end formation and 
polyadenylation signals of the T-DNA gene 7 (Velten and 
Schell , 1985) . 

In addition, the translation initiation context or 
site are changed in accordance with the Joshi consensus 
sequence (Joshi, 1987) in order to optimize the 
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translation initiation in plant cells. To this end, an 
oligo duplex (figs. 6a and 6b) is introduced to create 
the following sequence at translation initiation site: 
AAAACCATGGCT . In this way, an additional codon (i.e., 
GCT) coding for alanine is introduced. Additionally, 
Kpnl and BstXI sites are created upstream of the ATG 
translation initiation codon. 

Using standard procedures (Deblaere et al. , 1985) , 
the intermediate plant expression vectors , containing 
the modified BtlCP gene, are transferred into the 
Agrobacterium strain C58C1 Rif* (US patent application 
821,582; EPA 86300291.1) carrying the disarmed Ti- 
plasmid pGV2260 (Vaeck et al. , 1987) . Selection for 
spectinomycin resistance yields cointegrated plasmids, 
consisting of pGV2260 and the respective intermediate 
plant expression vectors . Each of these recombinant 
Agrobacterium strains is then used to transform 
different tobacco plant cells (Nicotiania tabacum) and 
potato plant cells ( S planum tuberosum ) so that the 
modified Bt ICP genes are contained in, and expressed 
by, different tobacco and potato plant cells. 

The transgenic tobacco plants containing the 
modified Bt ICP genes are analyzed with an ELISA assay. 
These plants are characterized by a significant 
increase in levels of Bt (Bt2) proteins, compared to a 
transgenic tobacco plant containing a non-modified Bt 
ICP (bt2) gene. 

The insect icidal activity of the expression 
products of the modified Bt ICP (bt2) genes in leaves 
of transformed tobacco and potato plants is evaluated 
by recording the growth rate and mortality of larvae of 
Tobacco hornworm ( Kanduca sexta) , Tobacco budworm 
(Heliotis virescens) and potato tubermoth (Phthorimaea 
operculella) fed on leaves of these two types of 
plants. These results are compared with the growth rate 
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of larvae fed leaves from tobacco and potato plants 
transformed with the unmodified or parental Bt ICP 
(bt2) gene and from untrans formed potato and tobacco 
plants. Toxicity assays are performed as described in 
EPA 88402115.5 and EPA 86300291.1. 

A significantly higher mortality rate is obtained 
among larvae fed on leaves of transformed plants 
containing and expressing the modified Bt ICP genes . 
Tobacco and potato plants containing the modified Bt 
ICP genes show considerably higher expression levels of 
Bt I CPs compared to tobacco and potato plants 
containing the unmodified Bt ICP gene. 

The insecticidal activity of three transgenic 
tobacco plants containing the modified Bt ICP genes is 
determined against second and third ins tar larvae of 
Heliothis virescens . The control plant was not 
transformed. The results are summarized in Table 2, 
below. 



2 0 Table 2 



% mortality of insects (recorded after 
5 days) 



Control 11 

No. 1 100 

No. 2 88.5 

No. 3 100 

Needless to say, this invention is not limited to 
tobacco and potato plants transformed with the modified 
Bt ICP gene. It includes any plant, such as tomato , 
alfalfa, sunflowers , corn, cotton , soybean, sugar 
beets, rapeseed, brass icas and other vegetables, 
transformed with the modified Bt ICP gene. 
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Nor is the invention limited to the use of 
A3 rob a cterium t uaefa ciens Ti-plasmids for transforming 
plant cells with a modified Bt ICP gene. Other known 
techniques for plant transformation, such as by means 
of liposomes, by electroporation or by vector systems 
based on plant viruses or pollen, can be used for 
transforming mon ocoty 1 edonons and dicotyledons with 
such a modified Bt ICP gene. 

Nor is the invention limited to the bt2 gene, but 
rather encompasses all Cry I, Cry II, crylll and Cry IV 
Bt ICP genes . 
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Claims 

1. A process for modifying a Bt ICP gene to improve 
its expression in a plant cell, transformed with the 

5 gene ? the process comprising the step of: changing A 
and T sequences in a plurality of translational codons 
of the gene to corresponding G and C sequences encoding 
the same amino acids, so as to improve the gene's 
transcription to an mRNA, the nuclear accumulation of 
10 the mRNA and/ or the nuclear export of the mRNA, 
particularly the gene 1 s transcription, in the plant 
cell. 

2. The process of claim 1 for modifying a Bt ICP gene 
15 to improve its transcription in plant cells, 

transformed with the modified gene, wherein the 
plurality of translational codons is at least one 
region of the gene which, during transcription, has 
thereon a relatively low percentage of RNA polymerase 
20 II of the plant cell as compared to another adjacent 
upstream region of the gene. 

3. The process of claim 1 or 2, wherein the Bt ICP 
gene encodes a Bt insecticidal crystal protein 
truncated towards a trypsin cleavage site, preferably 

25 at both its C-terminal and N-terminal ends, and 
preferably encoding a portion of the protein of about 
60 - 80 kDa, particularly the toxin of the protein. 

4. The process of anyone of claims 1-3, wherein A and 
30 T sequences of at least 3 codons are changed to G and C 

sequences at a translation initiation site of the gene 
and A and T sequences of at least about 3, preferably 
at least about 10, especially at least about 33, codons 
are changed to G and C sequences in a second region of 
the gene, preferably affecting transcription 
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elongation, downstream of the translation initiation 

site . 

5. The process of anyone of claims 1-4, wherein A and 
5 T sequences of at least about 3, preferably at least 

about 10, especially at least about 33, codons are 
changed to G and C sequences in a third region of the 
gene, preferably affecting cytoplasmic RNA 
concentration. 

10 

6. The process of claim 4 or 5 , wherein A and T 
sequences of at least about 3 codons are changed to G 
and C sequences at a translation termination end of the 
gene. 

15 7. The process of anyone of claims 4-6, wherein the 
gene is a cry I gene, such as a bt2 , bt!4 , bt!5 or bt!8 
gene, preferably a bt2 gene, or a gene having 
substantial sequence homology thereto . 

20 8. The process of claim 7 wherein the gene is a bt2 
gene ; the second region being between about nucleotides 
674 and 1000 and A and T sequences of about 59 or more 
codons are changed to G and C sequences in the second 
region, particularly between about nucleotides 73 3 and 

25 1000, quite particularly between about nucleotides 765 
and 794. 

9. The process of anyone of claims 1-8, wherein the 
gene is further modified by substituting for its ATG 

30 translation initiation site : AAAACCATGGCT . 

10. The modified Bt ICP gene obtained by the process 
of anyone of claims 1-9 . 
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11. A chimaeric gene for transforming a cell of a 

plant, comprising the following operably- linked DNA 
fragments in the same transcriptional unit: 

5 a) the modified Bt ICP gene of claim 10; 

b) a promoter capable of directing expression of 
the modified Bt ICP gene in the plant cell; and 

c) transcript 3 » end formation and polyadenylation 
signals suitable for expressing the modified Bt 
ICP gene in the plant cell. 

12. The plant cell of claim 11, transformed with the 
chimaeric gene of claim 11. 

15 

13. A plant, plant tissue or plant cell culture 
consisting of the plant cells of claim 12. 

14. A seed of the plant of claim 13. 

20 15. A vector, preferably a Ti-plasmid, for stably 
transforming the nuclear genome of a plant, comprising 

the chimaeric gene of claim 11. 

16. A process for protecting the plant of claim 10 
25 against an insect pest, comprising the step of: 

transforming the genome of the plant with the chimaeric 
gene of claim 11. 

17. A process for modifying a foreign gene whose rate 
and/ or level of expression in a plant cell, transformed 

30 with the gene, is substantially limited by the rate 
and/ or level of nuclear production of an mRNA encoded 
by the gene ; the process comprising the step of; 
changing A and T sequences in a plurality of 
translational codons of the gene, particularly in a 
plurality of translational codons in at least one 
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region of the gene which, during transcription, has 
thereon a relatively low percentage of RNA polymerase 
II of the plant cell as compared to another adjacent 
upstream region of the gene ; the A and T sequences 
being changed to corresponding G and C sequences 
encoding the same amino acids, so as to improve the 
gene ' s transcription to the mRNA, the nuclear 
accumulation of the mRNA and/ or the nuclear export of 
the mRNA, particularly the gene • s transcription to the 
mRNA, quite particularly the transcript elongation by 
RNA polymerase II on the gene, in the plant cell. 
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FIGURE 3(C0NT a ) 
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Figu re 6a (c ant.) 



Linear LENGTH- 52 



OLIGOPS15- 

-OLIGOPS16' 



3) OLIGOPS16' , 4) OLIGOPSI5, 



Name Base 

3 1 ATCGGTACCA AAACCATGGC TATCGAGACC GGTTACACCC CAATCGAT 

4 1 GTACCA AAACCATGGC TATCGAGACC GGTTACACCC CAATCGATAT CG 

CON 1 ATCGGTACCA AAACCATGGC TATCGAGACC GGTTACACCC CAATCGATAT CG 
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ATC GGT ACC AAA ACC ATG GCT ATC GAG ACC GGT TAC ACC CCA ATC GAT ATC G 
MET Ala He Glu Thr Gly Tyr Thr Pro lie Asp He 



SUBSTITUTE SHEET 



WO 91/16432 



15/26 



PCT/EP9 1/00733 



r . f < » A' < ' > -trnt'rt - trr, -rt'.f TAMi'^ ' "CH C77 (.-!. 

0LIGO2 

ATCGTGTCCCTATTCCCGAACTACGACAGCAGGACGTACCCAATCCGAACCGTGTCCCAGTTAACCAGGGA 
0LIG03 

GATCTACACCAACCCAGTGTTAGAGAACTTCGACGGTAGCTTCCGAGGCTCGGCTCAGGGCATCG 

OLIGO** 

AGGG AAGCATC AGGAGCCCACACTTGATGGACATCCTTAACAGCATCACCATCTACACGGACGCT 
0LIGO5 

CACAGGGGAGAGTACTACTGGTCCGGGCACCAGATCATGGCTTCCCCTGTGGGGTTCTCGGGGCCAGAATTCG 



0LIGO6 

GATCCGAATTCTGGCCCCGAGAACCCCACAGGGGAAGCCATGATCTGGTGCCCGGACCAGTAGTAC 
0LIGO7 

TCTCCCCTGTGAGCGTCCGTGTAGATGGTGATGCTGTTAAGGATGTCCATCAAGTGTGGGCTCCT 
0LIG08 

CATGCTTCCCTCGATGCCCTGAGCCGAGCCTCGGAAGCTACCGTCGAAGTTCTCTAACACTGGG 
0LIG09 

TTGGTGTAGATCTCCCTGGTTAACTGGGACACGGTTCGGATTGGGTACGTCCTGCTGTCGTAGTTCGGGAA 
0LIG01O 

TAGGGA CAC GATGTCTAACACGGTTAGGGTTAACTCCCTCCTGAACTGGTTGTACCTGATCC AGTCTCTAGAG 
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149 213 278 

77 148 212 277 

66 137 202 267 343 

65 78 136 201 266 339 

^„««.««^„„„„«„„„„.^«„^ 

— 0LIGO1 0 ' « OL1G09 ' <-- 0LIGO8 ' « — 0LIG07 ' < — 0LIG06' < 

— 0LIG01 > 0LIGO2 > OLIG03 > 0LIG04 > OL1 G05 > 



Name Base 

1 1 GATCCTCTAG AGACTGGATC AGGTACAACC AGTTCAGGAG GGAGTTAACC CTAACCGTGT 

10 1 CTCTAG AGACTGGATC AGGTACAACC AGTTCAGGAG GGAGTTAACC CTAACCGTGT 

CON 1 GATCCTCTAG AGACTGGATC AGGTACAACC AGTTCAGGAG GGAGTTAACC CTAACCGTGT 

1 61 TAGAC 

10 57 TAGACATCGT GTCCCTA 

2 1 ATCGT GTCCCTATTC CCGAACTACG ACAGCAGGAC GTACCCAATC CGAACCGTGT 
9 1 TTC CCGAACTACG ACAGCAGGAC GTACCCAATC CGAACCGTGT 

CON 61 TAGACATCGT GTCCCTATTC CCGAACTACG ACAGCAGGAC GTACCCAATC CGAACCGTGT 

2 56 CCCAGTTAAC CAGGGA 

9 4 4 CCCAGTTAAC CAGGGAGATC TACACCAA 

3 1 GATC TACACCAACC CAGTGTTAGA GAACTTCGAC GGTAGCTTCC 
8 1 CC CAGTGTTAGA GAACTTCGAC GGTAGCTTCC 

CON 121 CCCAGTTAAC CAGGGAGATC TACACCAACC CAGTGTTAGA GAACTTCGAC GGTAGCTTCC 

3 4 5 GAGGCTCGGC TCAGGGCATC G 

8 33 GAGGCTCGGC TCAGGGCATC GAGGGAAGCA TC 

4 1 AGGGAAGCA TCAGGAGCCC ACACTTGATG GACATCCTTA 
7 1 AGGAGCCC ACACTTGATG GACATCCTTA 

CON 181 GAGGCTCGGC TCAGGGCATC GAGGGAAGCA TCAGGAGCCC ACACTTGATG GACATCCTTA 

4 4 0 ACAGCATCAC CATCTACACG GACGCT 

7 29 ACAGCATCAC CATCTACACG GACGCTCACA GGGGAGA 

5 1 CACA GGGGAGAGTA CTACTGGTCC GGGCACCAGA 

6 1 GTA CTACTGGTCC GGGCACCAGA 

CON 241 ACAGCATCAC CATCTACACG GACGCTCACA GGGGAGAGTA CTACTGGTCC GGGCACCAGA 

5 35 TCATGGCTTC CCCTGTGGGG TTCTCGGGGC CAGAATTCG 

6 24 TCATGGCTTC CCCTGTGGGG TTCTCGGGGC CAGAATTCGG ATC 



TCATGGCTTC CCCTGTGGGG TTCTCGGGGC CAGAATTCGG ATC 
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21 54 
GAT CCT CTA GAG ACT GGA TCA GGT ACA ACC AGT TCA GGA GGG AGT TAA CCC TAA 
Ser Ser Arg Asp Trp lie Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr 

81 108 
CCG TGT TAG ACA TCG TGT CCC TAT TCC CGA ACT ACG ACA GCA GGA CGT ACC CAA 
Val Leu Asp lie Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro He 

135 162 
TCC GAA CCG TGT CCC AGT TAA CCA GGG AGA TCT ACA CCA ACC CAG TGT TAG AG A 
Arg Thr Val Ser Gin Leu Thr Arg Glu He Tyr Thr Asn Pro Val Leu Glu Asn 

189 216 
ACT TCG ACG GTA GCT TCC GAG GCT CGG CTC AGG GCA TCG AGG GAA GCA TCA GGA 
Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly lie Glu Gly Ser He Arg Ser 

243 270 
GCC CAC ACT TGA TGG ACA TCC TTA ACA GCA TCA CCA TCT ACA CGG ACG CTC ACA 
Pro His Leu MET Asp He Leu Asn Ser lie Thr lie Tyr Thr Asp Ala His. Arg 

297 324 
GGG GAG AGT ACT ACT GGT CCG GGC ACC AGA TCA TGG CTT CCC CTG TGG GGT TCT 
Gly Glu Tyr Tyr Trp Ser Gly His Gin He MET Ala Ser Pro Val Gly Phe Ser 

CGG GGC CAG AAT TCG GAT C 
Gly Pro Glu Phe Gly 
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AA ArGGATAAATAGCCrrGCTTCCTATTA'l ATCTTCCCAAATTACCAATACATTACACTAGCATC*f GAAT 
TTCATAACCAATCTCGATACACCAAATCGATGGATCCCGATAACAATCCGAACATCAATGAATGCATTCC 
TTATAATTGTTTAAGTAACCCTGAAGTAGAAGTATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCA 
ATCGATATTTCCTTGTCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAG 
GACTAGTTGATATAATATGGGGAATTTTTGGTCCCPCTCAATGGGACGCATTTCTTGTACAAATTGAACA 
GTTAATTA^CC^AAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTAGAAGGACTAAGCAAT 
CTTTATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGATCCTACTAATCCAGCATTAAGAGAAG 
AGATGCGTATTCAATTCAATGACATGAACAGTGCCCTTACAACCGCTATTCCTCTTrTTGCAGTTCAAAA 
TTATCAAGTTCCTCTTTTATCAGTATATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTT 
TCAGTGTTTGGACAAAGGTGGGGATTTGATGCCGCGACTATCAATAGTCGTTATAATGATT7AACTAGGC 
TTATTGGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGACCGGA 
TTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTATTAGATATCGTTTCT 
CTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTTTCCCAATTAACAAGAGAAATTTATA 
CAAACCCAGTATTAGAAAATTTTGATGGTAGTTTTCGAGGCTCGGCTCAGGGCATAGAAGGAAGTATTAG 
GAGTCCACATTTGATGGATATACTTAACAGTATAACCATCTATACGGATGCTCATAGAGGAGAATATTAT 
TGGTCAGGGCATCAAATAATGGCTTCTCCTGTAGGGTTTTCGGGGCCAGAATTCACTTTTCCGCTATATG 
GAACTATGG3AAATGCAGCTCCACAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGAACATT 
ATCGTCCACTTTATATAGAAGACCTTTTAATATAGGGATAAATAATCAACAACTATCTGTTCTTGACGGG 
ACAGAATTTGCTTATGGAACCTCCTCAAATTTGCCATCCGCTGTATACAGAAAAAGCGGAACGGTAGATT 
CGCTGGATGAAATACCGCCACAGAATAACAACGTGCCACCTAGGCAAGGATTTAGTCATCGATTAAGCCA 
TGTTTCAATGTTTCGTTCAGGCOTAGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCTCTTGG 
ATACATCGTAGTGCTGAATTTAATAATATAATTCCTTCATCACAAATTACACAAATACCTTTAACAAAAT 
CTACTAATCTTGGCTCTGGAACOTCTCTCGTTAAAGGACCAGGArTTACAGGAGGAGATATTCTTCGAAG 
AACTTCACCTGGCCAGATTTCAACCTrAAGAGTAAATATTACTGCACCATTATCACAAAGATATCGGGTA 
AGAA.TTCGCTACGCTTCTACCACAAATTTACAATTCCATACATCAATTGACGGAAGACCTATTAATCAGG 
GGAATTTTTCAGCAACTATGAGTAGTGGGAGTAATTTACAGTCCGGAAGCrTTAGGACTGTAGGTTTTAC 
TACTCCGTTTAACTTTTCAAATGGATCAAGTGTATTTACGrTAAGTGCTCATGTCTTCAATTCAGGCAAT 
GAAGTTTATATAGATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAA 
GAGCACAAAAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATGTGACGGA 
TTATCATATTGATCAAGTATCCAATTTAGTTGAGTGTTTATCTGATGAATTTTGTCTGGATGAAAAAAAA 
GAATTGTCCGAGAAAGTCAAACATGCGAAGCGACTTAGTGATGAGCGGAAXXXXXCCTCGA3CTTGGATG 
GATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAAT 
CGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTrTTTGTCAAGACCGAC 
CTGTCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTC 
CTTGCGCAGCTGTGCTrCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGG 
GCAGGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGG 
CTGCATACGCTTGATCCGGCTACCTGCCCArrCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTA 
CTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGA 
ACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGC 
TTGCCGAATATGATGGTGGAAAATGGCCGCTTTTCTGGATT C ATCGA CTGTGGCCGG CTG G ST GTGGCGG 
ACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCG 
CTTCCTCGTGCTTTACGGTATCGCCGOTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAG 
TTCTTCTGACAGATCCCCCGATGAGCTAAGCTAGCTATATCATCAATTTATGTATTACACATAATATCGC 
ACTCAGTCTTTCATCTACGGCAATGTACCAGCTGATATAATCAGTTATTGAAATATTTCTGAATTTAAAC 
TTGCATCAATAAATTTATGTTTTTGCTTGGACTATAATACCTGACTTGTTATTTTATCAATAAATATTTA 
A A CT AT ATTTCTTTCAAGATGGG AATT A A C ATCTAC AAATTGCCTTTTCTT 

The ATG initiation codon and the TGA stop codon are underlined. 
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AAATGGATAAATAGCCraGCTTCCTATTATATCTTC€CAAATTACCAATACATTACACTAGCATCTGAAT 
TTCATAACCRATCTCGATACACCAAATCGGTACCAAAAC CATG GCTATCGAGACCGGTTACACCCCAATC 
GATATTTCCTTGTCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAGGAC 
TAGTTGATATAATATGGGGAATTTTTCGTCCCTCrCAATGGGACGCATTTCTTGTACAAATTGAACAGTT 
AATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGAl'TAGAAGGACTAAGCAATCTT 
TATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGATCCTACTAATCCAGCATTAAGAGAAGAGA 
TGCGTATTCAATTCAATGACATGAACAGTGCCCTTACAACCGCTATTCCTCTTTTTGCAGTTCAAAATTA 
TCAAGTTCCTCTTTTATCAGTATATGTTCAAGCTGCAAATrrACATTTATCAGTTTTGAGAGATGTTTCA 
GTGTTTGGACAAAGGTGGGGATTTGATGCCGCGACTATCAATAGTCGTTATAATGATTTAACTAGGCTTA 
TTGGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGACCGGATTC 
TAGAGACTGGATCAGGTACAACCAGTTCAGGAGGGAGTTAACCCTAACCGTGTTAGACATCGTGTCCCTA 
TTCCCGAACTACGACAGCAGGACGTACCCAATCCGAACCGTGTCCCAGrTAACCAGGGAGATCTACACCA 
ACCCAGTGlTAGAGAACTTaSACGGTAGCTTCCGAGGCTCGGCTCAGGGCATCGAGGGAAGCATCAGGAG 
CCCACACTTGATGGACATCCTTAACAGCATCACCATCTACACGGACGCTCACAGGGGAGAGTACTACTGG 
TCCGGGCACCAGATCATGGCTTCCCCTGTGGGGTTCTCGGGGCCAGAATTCACTTTTCCGCTATATGGAA 
CTATGGGAAATGCAGCTCCACAACAACGTATTGTrcCTCAACTAGGTCAGGGCGTGTATAGAACATTATC 
GTCCACm'ATATAGAAGACCTTTTAATATAGGGATAAATAATCAACAACTATCTGTTCTTGACGGGACA 
GAATTTGCTTATGGAACCTCCTGAAATTTGCCA 

TGGATGAAATACCGCCACAGAATAACAACGTGCCACCTAGGCAAGGATTTAGTCATCGATTAAGCCATGT 

TTCAATGTTTCGTTCAGGCTTTAGTAATAGTAGTGTAAGTA7AATAAGAGCTCCTATGTTCTCTTGGATA 

CATCGTAGTGCTGAATTTAATAATATAATTCCrrCATCACAAATTACACAAATACCTTTAACAAAATCTA 

CTAATCTTGGCTCTGGAACTTCTGrCGTTAAAGGACCAGGArTTACAGGAGGAGATATTC'rTCGAAGAAC 

TTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATTACTGCACCATTATCACAAAGATATCGGGTAAGA 

ATTCGCTACGCTTCTACCACAAATTTACAATTCCATACATCAATTGACGGAAGACCTATTAATCAGGGGA 

ATTTTTCAGCAACTATGAGTAGTGGGAGTAATTTACAGTCCGGAAGCTTTAGGACTGTAGGTTTTACTAC 

TCCGTTTAACOTCTCAAATGGATCAAGTGTATTTACGTTAAGTGCTCATGTCTTCAATTCAGGCAATGAA 

GTTTATATAGATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAAGAG 

CACAAAAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATGTGACGGATTA 

TCATATTGATCAAGTATCCAATTTAGTTGAGTGTTTATCTGATGAA'TTTTGTCTGGATGAAAAAAAAGAA 

TTGTCCGAGAAAGTCAAACATGCGAAGCGACTTAGTGATGAGCGGAAXXXXXCCTCGAGCTTGGATGGAT 

TGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGG 

CTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTG 

TCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTT 

GCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCA 

GGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTG 

CATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAG CACGTACTC 

GGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACT 

GTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTG 

CCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACC 

GCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTT 

CCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 

TTCTGACAGATCCCCCGATGAGCTAAGCTAGCTATATCATCAATTTATGTATTACACATAATATCGCACT 

CAGTCTTTCATCTACGGCAATGTACCAGCTGATATAATCAGTTATTGAAATATTTCTGAATTTAAACTTG 

CATCAATAAATTTATGTTTTTGCTTGGACTATAATACCTGACrTGTTATrTTATCAATAAATATTTAAAC 

TATATTTCTTTCAAGATGGGAATTAACATCTACAAATTGCCTTTTCTTATCGACCATGTACGGGTACCGA 

GCT CG AATT CCT A CGCAG CAGGTCT CAT CAAGA CGATCTA CCCG AGTA A CA 

the ATG initiation cod on and the TGA stop codon are underlined. 
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Figure 8a 




30,20 



1 : Aatll 

2 : AccI 

3 : Aflll 

4 : Alvt.T 

5 : Apal 

6 : Avrll 

7 : BarfTI 



8 : 


Bell 


15 


EcoKV 


22 


NarX 


29 


RsrII 


9 : 


Bsj*2I 


16 


Eco31I 


23 


Ndel 


30 


Sad 


10 


s BssHII 


17 


Espl 
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Nhel 


31 
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: BstEII 
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HLndll 


25 


Nsil 


32 




12 


: BstXI 
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HindlH 


26 


PfLMI 


33 


Tth111I 
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: EagI 
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Kpnl 


27 


PpiiXI 


34 
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Figure 8a (cont. ) 




1 : Aatll 

2 : AccI 

3 : Aflll 

4 : AIvjNI 

5 : Apal 

6 : Avrll 

7 : BanKI 



8 : Bell 

9 : BspMTI 

10 : BssHII 

11 : BstEII 

12 : BstXI 

13 : EagI 

14 : EccNI 



15 : EcoKV 

16 : Eco31I 

17 : Espl 

18 : Hindll 

19 : Hirrflll 

20 : Kpnl 

21 : Nael 



22 : Narl 

23 : Ndel 

24 : Nhel 

25 : Nsil 

26 : PflMI 

27 : PpuMI 

28 : Pvul 



29 : RsrII 

30 : Sac I 

31 : Snel 

32 : 

33 : Tthllll 

34 : xcal 

35 : Xhcl 
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